11 research outputs found

    Gis-based gully erosion susceptibility mapping: a comparison of computational ensemble data mining models

    Get PDF
    Gully erosion destroys agricultural and domestic grazing land in many countries, especially those with arid and semi-arid climates and easily eroded rocks and soils. It also generates large amounts of sediment that can adversely impact downstream river channels. The main objective of this research is to accurately detect and predict areas prone to gully erosion. In this paper, we couple hybrid models of a commonly used base classifier (reduced pruning error tree, REPTree) with AdaBoost (AB), bagging (Bag), and random subspace (RS) algorithms to create gully erosion susceptibility maps for a sub-basin of the Shoor River watershed in northwestern Iran. We compare the performance of these models in terms of their ability to predict gully erosion and discuss their potential use in other arid and semi-arid areas. Our database comprises 242 gully erosion locations, which we randomly divided into training and testing sets with a ratio of 70/30. Based on expert knowledge and analysis of aerial photographs and satellite images, we selected 12 conditioning factors for gully erosion. We used multi-collinearity statistical techniques in the modeling process, and checked model performance using statistical indexes including precision, recall, F-measure, Matthew correlation coefficient (MCC), receiver operatic characteristic curve (ROC), precision-recall graph (PRC), Kappa, root mean square error (RMSE), relative absolute error (PRSE), mean absolute error (MAE), and relative absolute error (RAE). Results show that rainfall, elevation, and river density are the most important factors for gully erosion susceptibility mapping in the study area. All three hybrid models that we tested significantly enhanced and improved the predictive power of REPTree (AUC=0.800), but the RS-REPTree (AUC= 0.860) ensemble model outperformed the Bag-REPTree (AUC= 0.841) and the AB-REPTree (AUC= 0.805) models. We suggest that decision makers, planners, and environmental engineers employ the RS-REPTree hybrid model to better manage gully erosion-prone areas in Iran

    Combination four different ensemble algorithms with the generalized linear model (GLM) for predicting forest fire susceptibility

    No full text
    In this study, the generalized linear model (GLM) and four ensemble methods (partial least squares (PLS), boosting, bagging, and Bayesian) were applied to predict forest fire hazard in the Chalus Rood watershed in the Mazandaran Province, Iran. Data from 108 historical forest fire events collected through field surveys were applied as the basis of the analysis. About 70% of the data were used for training the models, while the remaining 30% was used for testing. A total of 14 environmental, climatic, and vegetation variables were used as input features to the models to predict forest fire probability. After conducting a multicollinearity test on the independent variables, the GLM and the ensemble models were applied for modeling. The efficiency of the models was evaluated using receiver operating characteristic (ROC) curve parameters. Results from the validation process, based on the area under the ROC curve (AUC), showed that the GLM, PLS-GLM, boosted-GLM, Bagging-GLM, and Bayesian-GLM models had efficiencies of 0.79, 0.75, 0.81, 0.84, and 0.85, respectively. The results indicated that all ensemble methods, except the PLS algorithm, improved the performance of the GLM model in modeling forest fire hazards in the Chalus Rood watershed, with the Bayesian algorithm being the most efficient method among them

    Novel Ensemble Approach of Deep Learning Neural Network (DLNN) Model and Particle Swarm Optimization (PSO) Algorithm for Prediction of Gully Erosion Susceptibility

    No full text
    This study aims to evaluate a new approach in modeling gully erosion susceptibility (GES) based on a deep learning neural network (DLNN) model and an ensemble particle swarm optimization (PSO) algorithm with DLNN (PSO-DLNN), comparing these approaches with common artificial neural network (ANN) and support vector machine (SVM) models in Shirahan watershed, Iran. For this purpose, 13 independent variables affecting GES in the study area, namely, altitude, slope, aspect, plan curvature, profile curvature, drainage density, distance from a river, land use, soil, lithology, rainfall, stream power index (SPI), and topographic wetness index (TWI), were prepared. A total of 132 gully erosion locations were identified during field visits. To implement the proposed model, the dataset was divided into the two categories of training (70%) and testing (30%). The results indicate that the area under the curve (AUC) value from receiver operating characteristic (ROC) considering the testing datasets of PSO-DLNN is 0.89, which indicates superb accuracy. The rest of the models are associated with optimal accuracy and have similar results to the PSO-DLNN model; the AUC values from ROC of DLNN, SVM, and ANN for the testing datasets are 0.87, 0.85, and 0.84, respectively. The efficiency of the proposed model in terms of prediction of GES was increased. Therefore, it can be concluded that the DLNN model and its ensemble with the PSO algorithm can be used as a novel and practical method to predict gully erosion susceptibility, which can help planners and managers to manage and reduce the risk of this phenomenon

    Evaluating the Efficiency of Different Regression, Decision Tree, and Bayesian Machine Learning Algorithms in Spatial Piping Erosion Susceptibility Using ALOS/PALSAR Data

    No full text
    Piping erosion is one form of water erosion that leads to significant changes in the landscape and environmental degradation. In the present study, we evaluated piping erosion modeling in the Zarandieh watershed of Markazi province in Iran based on random forest (RF), support vector machine (SVM), and Bayesian generalized linear models (Bayesian GLM) machine learning algorithms. For this goal, due to the importance of various geo-environmental and soil properties in the evolution and creation of piping erosion, 18 variables were considered for modeling the piping erosion susceptibility in the Zarandieh watershed. A total of 152 points of piping erosion were recognized in the study area that were divided into training (70%) and validation (30%) for modeling. The area under curve (AUC) was used to assess the effeciency of the RF, SVM, and Bayesian GLM. Piping erosion susceptibility results indicated that all three RF, SVM, and Bayesian GLM models had high efficiency in the testing step, such as the AUC shown with values of 0.9 for RF, 0.88 for SVM, and 0.87 for Bayesian GLM. Altitude, pH, and bulk density were the variables that had the greatest influence on the piping erosion susceptibility in the Zarandieh watershed. This result indicates that geo-environmental and soil chemical variables are accountable for the expansion of piping erosion in the Zarandieh watershed

    Applying different resampling strategies in machine learning models to predict head-cut gully erosion susceptibility

    No full text
    Gully erosion is one of the advanced forms of water erosion. Identifying the effective factors and gully erosion predicting is one of the important tools to control and manage such phenomenon. The main purpose of this study is to evaluate the effect of four different resampling algorithms including cross-validation (5-fold and 10-fold) and bootstrapping (Bootstrap and Optimism bootstrap) on boosted regression tree (BRT), support vector machine (SVM), and random forest (RF) models in spatial modeling and evaluation of head-cut gully erosion in Konduran watershed. For this purpose, based on an extensive field survey, the points of the head-cut of the gully erosion were identified first, and a map of the distribution of head-cut gully erosion in the study area was prepared. Then 18 variable identify and prepare as factors affecting the occurrence of head-cut gully erosion. To assess the efficiency of the models, receiver operating characteristics (ROC) and area under the curve (AUC) were used. The results of the assessment indicated that the use of resampling algorithms increases the efficiency of the models. The integrated optimism-bootstrap-BRT, optimism-bootstrap-SVM, and Optimism-Bootstrap-RF models with AUC 0.85, 0.823 and 0.89 respectively, outperformed the cross-validation 5fold (BRT, SVM, RF), Cross-validation 10fold (BRT, SVM, RF) and Bootstrap (BRT, SVM, RF) integrated algorithms

    Using computational-intelligence algorithms and remote sensing data to optimize the locations of check dams to control sediment and runoff in Kandolus watershed, Mazandaran, Iran

    No full text
    Construction of check dams is one of the important measures in controlling soil erosion and sediment entering the reservoirs of dams. These structures, which provide water reduction and sedimentation by modifying the slope of the canal, play a significant role in preventing the transfer of sediment and pollutants to dam reservoirs, and water bodies. The most challenging task, however, is to identify suitable sites for check dams. In the present study, four machine learning algorithms (namely K-nearest neighbour (KNN), extreme gradient boosting (XGBoost), extremely randomized tree (ERT), and random forest (RF)) and high-resolution remote sensing data were used to find the optimal locations of check dams in the Kandolus watershed in Mazandaran Province (Iran). Sixteen topographical, hydrological, and geomorphic factors were used as inputs in the abovementioned machine learning approaches to identify appropriate locations of check dams. The models were evaluated using receiver operating characteristics (ROC) statistical analyses. The results showed that the RF, ERT, XGB, and KNN models could accurately identify the suitable locations of check dams with the area under curve (AUC) values of 0.93, 0.92, 0.83, and 0.82, respectively. According to the AUC values, RF had the highest accuracy to identify the suitable locations for check dams. This study demonstrate that the utilized artificial intelligence methods and remote sensing data can help land-use planners and water resource managers identify optimal locations of check dams to more efficiency control floods, reduce erosion and land degradation, and enhance groundwater recharge

    Comparative Analysis of Artificial Intelligence Models for Accurate Estimation of Groundwater Nitrate Concentration

    No full text
    Prediction of the groundwater nitrate concentration is of utmost importance for pollution control and water resource management. This research aims to model the spatial groundwater nitrate concentration in the Marvdasht watershed, Iran, based on several artificial intelligence methods of support vector machine (SVM), Cubist, random forest (RF), and Bayesian artificial neural network (Baysia-ANN) machine learning models. For this purpose, 11 independent variables affecting groundwater nitrate changes include elevation, slope, plan curvature, profile curvature, rainfall, piezometric depth, distance from the river, distance from residential, Sodium (Na), Potassium (K), and topographic wetness index (TWI) in the study area were prepared. Nitrate levels were also measured in 67 wells and used as a dependent variable for modeling. Data were divided into two categories of training (70%) and testing (30%) for modeling. The evaluation criteria coefficient of determination (R2), mean absolute error (MAE), root mean square error (RMSE), and Nash–Sutcliffe efficiency (NSE) were used to evaluate the performance of the models used. The results of modeling the susceptibility of groundwater nitrate concentration showed that the RF (R2 = 0.89, RMSE = 4.24, NSE = 0.87) model is better than the other Cubist (R2 = 0.87, RMSE = 5.18, NSE = 0.81), SVM (R2 = 0.74, RMSE = 6.07, NSE = 0.74), Bayesian-ANN (R2 = 0.79, RMSE = 5.91, NSE = 0.75) models. The results of groundwater nitrate concentration zoning in the study area showed that the northern parts of the case study have the highest amount of nitrate, which is higher in these agricultural areas than in other areas. The most important cause of nitrate pollution in these areas is agriculture activities and the use of groundwater to irrigate these crops and the wells close to agricultural areas, which has led to the indiscriminate use of chemical fertilizers by irrigation or rainwater of these fertilizers is washed and penetrates groundwater and pollutes the aquifer

    Flood susceptibility mapping using an improved analytic network process with statistical models

    No full text
    Abstract Flooding is a natural disaster that causes considerable damage to different sectors and severely affects economic and social activities. The city of Saqqez in Iran is susceptible to flooding due to its specific environmental characteristics. Therefore, susceptibility and vulnerability mapping are essential for comprehensive management to reduce the harmful effects of flooding. The primary purpose of this study is to combine the Analytic Network Process (ANP) decision-making method and the statistical models of Frequency Ratio (FR), Evidential Belief Function (EBF), and Ordered Weight Average (OWA) for flood susceptibility mapping in Saqqez City in Kurdistan Province, Iran. The frequency ratio method was used instead of expert opinions to weight the criteria in the ANP. The ten factors influencing flood susceptibility in the study area are slope, rainfall, slope length, topographic wetness index, slope aspect, altitude, curvature, distance from river, geology, and land use/land cover. We identified 42 flood points in the area, 70% of which was used for modelling, and the remaining 30% was used to validate the models. The Receiver Operating Characteristic (ROC) curve was used to evaluate the results. The area under the curve obtained from the ROC curve indicates a superior performance of the ANP and EBF hybrid model (ANP-EBF) with 95.1% efficiency compared to the combination of ANP and FR (ANP-FR) with 91% and ANP and OWA (ANP-OWA) with 89.6% efficiency
    corecore